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1 . Field of the Invention 

Thft present invention relates to a data search system, 
and in particular to a data structure for search and a method 
of constructing the same, 

2. Description of the Related Art 

Ae a data search mathod, there have been known a table 
^searoii LIiaL uses a Lable for search auU a raai^t aearcU LlicaL ustta 
a tree structure for search. The table search such as a linear 
search or a binary search has a disadvantage that it takes 
increasingly mucn time tor searcn witn increase in tne amount 
of data to be ccarchcd. 

The raaix searcn treats data as strings of symbols selected 
from a limited number of kinds of symbols and uses a tree 
aLrucLure ixi which a seL of Lhe sLriuys of ayiubols claiSbified 
according to a sequential one of symbols starting at the top 
of each string of symbols and each node is linked to several 
others by branches. Each node linked to zero or more children 
Gtorca pointers to zero or more children as many as the number 
Of Kinds included m the strinq of symbols . Provided with searcn 
data consisting of a plurality of symbols, the radix search is 
carried out by following nodes of the tree from a parent to its 
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selected child according to a sequential one symbol of the given 
Search data until a target is loand. 

Taking a lnngA?;t prp.fi v <;ft;^rch method as an example, the 
radix search method will be described in detail. In the longest 

^ prerix searcn metnoa, eacn item ot aata to t>e searcnea is 

represented by a fixed-length (L) or less string of bite and 
nas Its related information. When provided wicn searcn data, 

Q all items of data matching the eearch data are found and on© 

C| iLeiii htaviiiy Lhe luiiyt;:?L iuaLch <iniuny Lheiu ib iielec Led <ab a LaiyeU. 

^ 10 In the case where each item of data to be searched and 

^ its related information as shown in Fig. 1, a binary tree in 

N which the data is stored is shown in Fig. 2. For example, wnen 

Q search data ''OOOllll" is provided, the search is carried out 

irfj t^v foliowmq nodes m tne order presented; Nl, N2, N4, no, 

W 

IS and N7 while reading the related information from a node having 
a node inrormation flag set (indicated by blacked circle) . In 
thi.^ cr^.^fi. Fit thA nndp-.c; N.*^, Nfi , and N7, th*^ T\nc\f^ Information 
flag is set. In other words, the item of data stored in each 
of rnese nodes N3, N6, and N7 matcnes tne searcn daia. Among 

20 them, the related information "D" stored in the node N7 having 

tne lonqest matcn (the lowest level} is oDtalned as a tarqet. 
Assuming that the time required to follow one node is T, 
Lhe maxliiiuiii bearch Lime is 6T in the case ol Fig. 2, 

Tn order tO rerinnp. thp. rippth nf a trpp to shortAD 
25 the maximum search time, N-ary tree structure (N > 2) is usually 
employed. In the case of N=4 Cquad tree structure) , the searcn 
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data is sequentially read in units of two bits. Accordingly, 
an item of data consisting of an odd number of bits is expanded 
to data having a bit length of an integral multiple of 2 so as 
to meet the quad tree structure . Forcxample, 5 bit data "00011" 
having the related information "D" is expanded to two items of 
6-bit datai "000110" and "000111". each having the same related 
Inf unuaLlon ""D". as ahuwii iu Fly. 3. 

In the case where each item of expanded data to be searched 
and its related information as shown in Fig. 3. a quad tree in 
whinh thA data 1 c; storftfl ifi shown 1n Pig. 4. Kach node 'srorfts 
data having n format composed of a node information flag field 
irG, a pointer field having four pointers each corresponding 
branches: "00", "01", no% and "11" to its child nodes, and 
a. relciLed inf urin<aLluu field. The uodti laf uriiiiiLlon Ilag FG is 
set to 1 when storing data and to 0 when stori.ng no data. Tn 
the node N2, for example, the node information flag FG is set 
to 1. The respective four pointers each corresponcLlng to "OU", 
"01", "10", and "11" indicate N3, N4, N5 and NULL, which mean g 

thai the node N2 has three corresponding branches to the 
child nodes N3 , N4 , and N5 and has no branch for "11" , The node 
N2 aLurea Lhe related lufurmaLlua A. 

For example, when search data "0001111" is provided, the 
search is carried out by following nodes in the order presented; 
Ni. N4, ana N7 while reading the reiatea information 
from nodec N2 , N4, and N7 each having the node information flag 
FG aet to 1 (indicated by blacked circle). In this case, the 
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item of data stored in each of these nodes N2 , N4 , and N7 matches 
the search data "0001111". Among them, the relatecl inlormation 
"D" stored 1n the. noria N7 having thft 1ongft5;t mat.nh (thp. lowest 
level) is obtained as a target- Assuming that the time required 
5 to tollow one node is the maximum search time is 4T in the 
cace of Pig. 4. Compared with the binary tree ae shown in Fig. 
2, the quad tree reduces its maximum search time to two-thirds 

Q the maximum search time of the binary tree, 

Huwevcii', if N £(jr N-ary Lret; iii ;?eL Lo a large number so 

f^, 10 as to shorten the. maximum search time, then the numhe.r of nndR.c; 

having the same related Information is increased by expanding 
the data to meet the N-ary tree, m aadition. since the size 

B ol:* one node and the number of pointers included therein are also 

fy increased in proportion to N, an increasing amount of memory 

m : 

Q 15 is needed. 

^ For example, in the case of each node storing 1-bit node 

information flag FG, four 6-bit pointers, and 8-bit related 
information, each node needs 21 bits (- 1 + 6 x 2 -i- 8) for the 
binary tree and y6 bits (=1+6x4+8) for the quad tree. 

20 The total of bite needed in the binary tree ae ehown in Fig . 
2 IS 21(blts ) X 15(nodes) = 315 bits and the total of bits needed 
in the quad tree as shown in Fig. 4 is 33 (bits) x 14 (nodes) = 
462 bits. Assuming that the length of bits per address in 
rH memory .qtnring the (\ritr\ tn he searched 15? 32 hits, 15 (Addresses 

25 are needed for the binary tree but 26 addresses for the quad 
tree . 
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2^5? described above, according to a conventional N-ary tree 
data structure, the number of nodes having the some related 
^ information is increased by expanding the data to mp.ftt t.hft N-ary 
4 tree. In addition, since the size of one node and the number 
j 5 Of pointers incluQed tnerein are also increased m proportion 
to N, an increaeing amount of memory is needed. 



'Zl SUmARY OF THE INVENTION 

'■'■-J 

G 

An object o£ the present invention ia to provide a data 

'"^i search system and a data structure allowing the necessary amount 

O iO of memory to be reduced while maintaining or improving search 

\. performance, compared with the radix searcli mtithoa. 

According to the presAnt invention, a data structure in 
which items of data are stored for search, includes; a tree 
structure in which the items of data are stored except for a 
15 portion of the items of data corresponding to a sub- tree 
structure, which is a selected portion of an assumed tree 
structure formed by all the iteme of data; and an equivalent 
table storiny the pur Lion of Llie items of data in table furai. 
The tree structure may include a plurality of nodes, each 
20 of which is composed of a nod© information flag, a plurality 
of pointers each corresponding to predetermined branches, ana 
related information, wherein each of the pointcro indioatcc one 
of its Child node, tne equivalent table, and null. The 
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equivalent table may include a plurality of entries, each of 
which is composed ul a Ldblt! uudts luformaLluii fleiy, a. Lcili entry 
flag, a riata bit string, a search bit length, and related 
information. 

^ Tne data bit string may be arranged so that a length of 

the data bit string is equal to that of search data, wherein 
the search bit length indicates a length of an original data 
bit string to match with the search data. The entries in the 
tiquivcileiiL Lcible iiiiiy be a Lured at consecutive; lucaLluiis iu 
10 a memory. 

The 5ub-tree structure may be selected so as to satisfying 
f.hft following conrti t.l ons : a) an amount of memory required to 
store the data structure is smaller than that required to store 
the assumed tree structure; and b) search performance of tne 
15 data structure is not lower than that of the assumed tree 
s tr uc Lure . 

ikccording to another- aspect of the present invent-inn, 
a method for constructing a data structure in which items of 
aata are stored tor search, includes the steps of: a) forming 

20 an accumcd tree structure in which all the itcmc of data arc 

stored; b) sequentially selecting a node from tne assumed tree 

structure to select a sub-tree structure designated by the 
StilticLfcid nude; u) furuiliiy an equivaleiiL table b Lorliiy a pux LiOn 
of the items of d^ta rnrre.sponrl i ng to the selected .c;nh-trfte 
25 structure in a table form; d) determining whether the selected 
sub-tree structure satisfies the following conditions: 1) an 
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amnnnt of mp.mnry rp.qiiirftd to sf.orft a data structure including 
the equivalent table in place of the selected sub tree structure 
IS smaller than that required to store the assumed tree 

I 

ctruoturc; and 2) cearch performance of the data structure ie not 
5 lower than that of the assumed tree structure; and e) when LUe 
selected sub-tree structure satisfies the conditions (1) and 
(2), replacing the selected sub-tree structure with the 
equi.va.l.ent table tn rnnstrnrt thft data structure . 

The condition (1) may be that, when the selected sub 

1{) tree structure is replaced with the equivalent table to form 
a new data structure, a maximum ccaroh time Tmax_t calculated 
from the new data structure does not exceed a maxiiuuiii awcii'ch 
time Tmax calculated from the assumed tree structure. The 
condition (2) may be that, when the selected sub- tree structure 

IS i<; rp-plarp.d with the aqnivalent table to form a new data 
structure, a necessary amount of memory for the new data 
structure is smaller than that for the assumed tree structure. 

A dscision on whether the conditions (1) and {2) may be 
b-ciLisfied is iiiadt! dependiuy on wheLher Lhe following equation 

20 is satisfied: 

^ X K, When K = Te/Tn, 
wherA 1 s the number of items of data included in the selected 
sub- tree structure, is the number of levels of the 
selected node or lower in the assumed tree structure, Tn is 
25 search time per node, and T© is search time per entry in the 
equivaleuL table . 
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According to fi.irthp.r another aspect o**^ thp. pre.*? en t 
invention, an apparatus for constructing a data structure in 
wtiicn items of data are stored tor searcn, includes: a tree 



formation ccotion for forming an accumcd tree ctruaturc in which 

all tne Hems of data are stored; a node selector for 
sequentially selecting a node from the assumed tree structure 
Lu belecL a ijub-Lrtse b Lxu(jLurt=; dei^lyuaLed Lhe selecLed node, 
forming an equivalent table storing a portion of the items of 
data corresponding to the selected sub-tree structure in a table 
form, and determining the selected sub-tree structure wJien it 
oatisfica the following conditions; 1) an airiount of memory 
required to store a data structure inciuaing tne equivalent 
table in place* of the selected sub-tree structure is smaller 
Lhaii LhaL required Lu is Luxe Liie a:ab'umed Lree sLrucLur e; and 2) 
sp.arr.h perf nrmrjnnp. of thft 6rita fitrncttirft i ?! nnt 1 owe.r than that 
of the assumed tree structure; and a data structure formation 
section for replacing the selected sub-tree structure 
satisfying the conditions (1) and (2) with the equivalent table 
correspondlnq to the selected sub-tree structure to construct 
the data structure. 

According to still another aspect of the present invention , 
a .qftr^rch .qyfitftm inrlurlftS! a memory storing a data strncture in 
which items of data are stored for search, the data structure 
comDrising: a tree structure m wnicn the items of data are 
ctorcd oxocpt for a portion of the itemc of data corresponding 
Lu a bub-Lree aLrucLure, which is a aelecLed purLiuii uf ciu 
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a.5;«;i]mft(i trftft strnctiirft formed by all thft It.ftms of data; anri 
equivalent table storing the portion of the items of data in 
table form: and a search section for searching the data structure 

i 

for an item of data matching input eearch data- 

The sedrch aystem lUciy further include: a tree furniation 
section for forming an assumed tree structure in which all the 
items ol data are stored; a node selector lor sequentially 
selecting a node, from the ^»ssuTned tree stri.ictnrp. to sp.I p.nt a 
sub- tree structure designated by the selected node, forming an 



• lu equivalent table storing a portion of the Items ot aata 



corresponding to the cclcotcd oub-trcc ctruoturc in a tabic form, 
ana aetermining the selected sub- tree structure when it- 
satisfies the following conditions: 1) an amount of memory 
required Lo store a data structure including the equivalent 
IS table in plane of t.h^. sel e.r.ted sub-tree structure Is smaller 
'r^ than that required to 3tor<5 the assumed tree structure; and 2) 

search performance of the data structure is not lower than that 
of the assumed tree structure; and a data structure formation 
section for replacing the selected sub-tree structure 
20 satisfying the conditions (1) and (2) with the equivalent table 
corresponding to the selected sub-tree structure to construct 
thp riata fitrnr.tnre that IR storftri 1n f.hft memory. 

According to another aspect of the present invention, a 
storage meaium for used in a search system, storing 
25 computer-readable iteme of data to be searched in a data 

bLrucLure, Lhe dciL<i islrucLure cumyriiiiuy ; a tree sLrauLuLe in 
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whi.ch the. Items of data are stored e^xcept for a portion of the 
items of data corresponding to a sub- tree structure, which is 
a seiectea portion ot an assumed tree structure rormecl hy all 
the itemc of data; and an equivalent tabic storing the portion 
of the items of data in table form. 

According to another aspect of the present invenrion, a 
isLurciye aiedluui alurlny a uumpuLer-xeaddblt; pxuyraui fux 
f=^ nnn.strnct i.ng a data structure in which items of data are stored 

for search, the program includes the steps of; a) forming an 
K) assumed tree structure in which all the items of data are stored; 
Q b) sequentially selecting a node from the assumed tree structure 

N! to select a sub-tree structure designated by the selected node; 

3 

□ c) forming an ©quivalsnt table storing a portion of the items 

uf daLa currtsdpuadlug Lo Lhe seleoLed sub-tree structure in a 
IS tahlp. form; d) dfttp.rmining whethp.r thft sp.l ftctf^d snh-f.rp.ft 

structure satisfies the following conditions; 1) an amount 
of memory required to store a data structure including rhe 
equivalent table in place of the selected sub-tree structure 
is smaller than that required to store the assumed tree 
20 structure; and 2) search performance of the data structure is not 
lower than that of the assumed tree structure; and e) when the 
.qftlAOted 5ub-tree strncturA satisfie.? the conditions; (1) and 
(2), replacing the selected sub-tree structure with the 
eauivalent table to construct the data structure. 
25 Ae described above, the equivalent table stores a 

jjluj-allLy of entries wlLhouL Lhe need uf poiuLers Lo child iiodeb 
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as in th© case of a tree structure, resulting in reduced amount 
of luemuxy inquired Lu blurt; Llie ddLei sLxucLure* 

Further, a node, to be replaced with an equivalent table 
is selected on condition that, if the node is replaced with tlit; 
D equivalent table, tnen the maximum search time for a new data 
ctruoturc including the equivalent tabic is not longer than the 
caiculatea maximum search rime and the necessary amount 
of memory is reduced- In other words, when the number of items 
uf delta existing at the nude or lower in the assumed tree 
10 structure is sufficiently small, the node is replaced with the 
equivalent table. Accox-dingly , the present invention provides 
%l a search system allowing the necessary amount ot memory to be 

Q reduced without decreasing in performance compared with the 

pj conventional radix search metnoQ. 

n 

1^ R]?TKF nR?;r.PTPTTON OF THF DRawTNa?; 

Fig. 1 IS a aiagram snowing an example 'of a set or aaia 

to be Gcarohed and correcponding related information; 

Fig. 2 is a diagram showing a conventional binary tree 
structux'o in which the data to be searched are stored; 



2iJ Fig- 3 is c diagram showing a set of expanded data to be 

searched and corresDondmg reiatea intormation. wnerein the 
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expanded data ar^ nht.ainftd by AxpanHing thA data a?: 5;hnwn in 
Fig. 1 to meet a quad tree; 

1 

Fig, d ie a diagram showing a conventional quad tree 
structure in which the expcinded data to he searched are stored; 

Fig. 5 is a schematic block diagram showing a data search 
systftm that 1 mpl Amp.n t?; a fi^ta .^trnctnre according to thft present 
i n vent ion ; 

Fig. 6 ic a diagram chowing an example of a data ctructurc 
for search accordlnq to an first embodiment of the present 
invention; 

Fig. 7 is a flow chart showing a search operation using 
the data structure according to the first embodiment; 

Fig. 8 is a schematic block diagram showing a data search 
iybLeiu having a daLa cuu^'truction section according to the 
15 present invention; 

Fig. 9 is a diagram showing an example of a set of data 
to be searched and corresponding related information; 

Fig. 10 is a flow chart showing a data-structure 
construction operation according to an second embodiment of the 



10 



ii W 
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present invention ; 



Fig. 11 IS a aiagram snowing an example of a tree structure 
formed in a tra© formation section according to tho eecond 
eiubuaiuiwiiL ul LhfcJ pretpfciul iuveiiLiua; and 



Pig. 12 is a diagram showing another example of a data 
structure for search constructed by the second embodiment of 
the present invention. 



U 



DESCRIPTION OF THE PREFERRED EMBODIMENTS 



Referring to Fig. 5, a search system is composed of a 
10 computer 11 , an input device 21 such as a keyboara and a pointing 
device, and an output device 31 such ac a monitor. The computer 
11 implements a search section 12 by running a program thereon. 
A memory 13 is a main memory for a processor to store a data 
structure according to the present invention, 
IS Referring rn V'ig. fi, (iatri structure acrnrd ing to a fir.qr 

embodiment of the present invention is formed by an equivalent 
taoie ana a quad tree usca in tne raaix search, assuming 
the maximum bit length L = 8 bite. In thio example, the data 
structure as shown in Fiq.;6 is obtained by replacing the node 
20 N2 and its child nodes N3 -N7 as shown in Fig . 4 with the equivalent 
toble N2_t . In other words . a set of data represented by the node 
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that is, r^ll thp, datPi innliitip.d in ;^ SAt nf bit string?; p.^oh 
starting with **00" , are stored in the equivalent table N2_t . 

AS Shown in t'ig. 6 , each of the nodes Mi . N8 - Ny-i^Jl4 scores 
information having a format compoeed of a node information flag 

5 field FG, a pointer field havinq four pointers each 

corresponding branches: "00", "01', "10", and "ll" to its 
child nodes, and a related information field. The node 
information flag FC; i s 5;<=it to 1 whp.n storing data tn hft sftarr.hftd 
and to 0 when storing no data to be searched. 

itj in the node Nl, tor example, tne noae intormation tiag 

FG is act to 0 because the node Nl docs not store the related 
information correspondinq to data to be searched. The - 
respective four pointers each corresponding to 00" , '"01", "10", 
and "11" indicate the equivalent table N2_t, NULL, NULL, and 

IS thp. nndft Nfi . Sinr.ft the. nocie Information flag FG is set to 0, 
the node Nl stores an invalid value in the related 

information field thereof. 

In the node N8 , for example, the node information flag 
FG is set to 0 because the node N8 does not sLoru daLa Lu be 

2G s^archod. The respective four pointers each corresponding to 
branches "00", "01"; ''10", and "11" indicate NULL, the node N9 , 
NIIT.r, , and thft node NIO. Since the norte information flag FG is 
set to 0, the node N3 stores an invalid value in the related 
intormation field thereof. 

15 In the node Nil, for example, the node information flag 

FG is seL Lu 1 becaube Lhe node Nil alureb dala Lu be tiearuhed 
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(here, "110100"). All four pointers each corraeponding to "00 " , 
"01'\ "10", and "11" liidludLcj NULL beceiusti Lhe node Nil Is a 
Ip.af J^o6e• Since the node Nil stores the data "110100", the 
related information "E" is stored (see Fig. 3). 
5 *rne eguivaieni: taDie N2jt stores as entries the items of 

data to be £;earched corresponding to the node N2 and its 
child nodes N3-N7 as snown m Fig. 4, Here, the respective 
O entries are located as entries at consecutive -addresses in 

Lhfej memory 13. Eetoh of Lhe enLries stored in the equivalent 

*%! 

f«j 10 table N2_t has a format composed of a 2 -bit table nods 

information flag field FG , a tail entry flag field, a data bit 
String field, a search bit length field, and a related ■ 
information field. 

A 2 -bit table node information flag FG is set to "2" for 
15 all the entries of the table N2_t. A tail entry flag is set to 
"1" for the last eatry aad to "0" lor the other entries . A a triny 
of rJr^trJ bitjs for Rar.h pntry i ?: fnnnp.fi by ariding nnp. nr mnrp. bits 
*'0" to the corresponding item of data to be searched so that 
the bit length of ine corresponding item of data is equal to 

2U the maximum bit length L (L = 8) of ccaroh data. The search bit 

length indicates the number or bits to be searched from tne 
topmost bit. The related information field stores related 
information corresponding to the item of data to be sedrched. 

<qp.Ah!r.H OPRRATTON 

25 In the above- described data structure, vhen branching to 

a node having a node information flag of "u" or "i" , the search 
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is r.arr-iAH nnt hy fnl lowing fhft tree of nodes while sequentially 
checking the search data in units of two bits as the case of 
Fig. 4. When branching lo one having tne node informcition flag 
of "2", the control Ic shifted to the equivalent table N2_t and 
5 the search operation is carried out by simply searching the 
equivalent table N2_t for an entry matching the search data. 
The details ol search operation will be described herealter. 
Q Rflf erring tn Fig. 7, when inputting search data through 

the input device 21, the search section 12 reads the information 
lu ot a root node irom the memory 13 storing tne data structure 
as shown in Fig. 6 {ctcp Al). Then, the ccaroh Gcction 12 
'"■^ determines wnetner tne node information flag FC- or a node m 

O question is "0" or "1" (stepA2). When the node information flag 

PJ FG is "1", the search section 12 stores the related inlormation 

In 

15 stored in the. nodft and a length of matching bits into a memory 
(not shown) (step A3). Here, a length of matching bits is 
dependent on the level of a node in question. For example, in 
a node at a level NL, a length of matching bits is 2 x {NL - 
1) ^ 

20 After the step A3 or when the node information flag FG 

is "0", a pointer corresponding to the first 2 bits of the search 
darn is selected from the tour pointers (step A4) and it is 
determined whether the value of the selected pointer is NULL 
(step Ab). if the value of the selected pointer is NULL (YES 
25 at ctGp AS), then the search section 12 selects the 

longest matching one rrom Lhe related ixif onuaLiou sLored and 
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outpnt.s 1 1. f^Pi a searrth rasnlt. ta th« output dftvl.ne 31 (stftp ^^2) 
and then the search operation is ended. 

If tne value of the selected pointer is not null (no ai 
step AS), then the search section 12 inputs the information of 
5 a biaxich iixdiuaLtfd by Lh« ^iwleuLed yuiiiLer (bleiJ A6). AL Llils 
stage, it cannot be known if the branch Indicated by the selected, 
pointer is a node or the first entry of the equivalent table. 
Therefore, the search section 12 accesses a selected number of 
addresaea of the memory 13 starting with the address indicated 

10 by the selected pointer using one which of the number of 

addreeeee required to store data for a node and the number of 
tiddrttsseb required Lu sLuie daLa for an eiiLry uf Llit; equivaltiuL 
table is greater than the other. In the case where the number 
of addresses required to store data for a node is "2" and 

13 the number of addresses required to store data for an entry of 
the equivalent tabic ic "i", the search section 12 accccccc the 
two addresses of x and where rhe address x is indicared 

by the selected pointer. 

Alter the information of a branch indicated by the selected 

7.0 pointp.r is inpnrtp.r^ («;fp.p Afi), it is dete.rmined whether a norie 
information flag FG is "2" (step A7) . Vihen the node information 
flag IS not "'^^ (NU at: step A7), tne control goes oacx to 
the step A2 and the search is carried out by following the tree 
as indicated by the steps A2-A6. 

25 When the node information flag FG is "2" (YES at step A7 ) , 

it means that the branch is the equivalent table N2_t . 
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Acnnrrii ngl thp. sp.aTnh sp.ctinn ^> Hp.t.p.rTTri np.s whp.tbp.T* the. data 
bit string of a selected entry in the equivalent table 

NZ_t matches the search data with respect to tne search bit 

! 

length thereof (ctop 
s When the data hit string of a selected entry In the 

equivalent table N2_t matches the search data with respect to 
the search bit length thereol (YES at step AB ) , the search 
section 17 «;tr>rp.?; thp. r(=;latpd information i=;torpd in thp. pntry 
in question and a length of matching bits (equal to the 
W Corresponding search bit length] into the memory (not shown) 
(ctcp A9) . 

Arter tne step A9 or wnen tne data bit strlnq of a selected 
entry in the equivalent table N2_t does not match the search 
data with respecL Lo the search bit length thereof (NO at step 
IS Afi) , it 15? HptP.rminpd whpthp.r th*^. p.ntry in qnp.<;tinn i >i the tail 
entry (step AlO). When the entry in question is not the tail 
entry (NU at step AluK the subsequent entry is selected (step 
All) and then the steps A8-A10 are repeatedly performed until 
Lhe tail entry. When Lhe entry in question is the tail entry 
2U (YES at step AlO). the search section 12 selects the 

longest matching one from the related information stored and 
outputs it as a search result to the output device .11 (step A12) 
and then the search operation is ended. 

The search operation will be described taking the data 
25 structure as shown in Fig. 6 as an example. 

Ai>i>uuiiuy LhaL aeaiuh daLa "00011111" lb iupuL Lhruuyh Lhe 
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input aevice 21. tne searcn section 12 reads the information 
of the root node Nl from the memory 13 storing the data structure 
as shown in Flq. 6 and then determines whether the node 
information flag FG of the node Nl is "0" or "1" (stops Al and 
5 A2) . Since the node xnlormation Hag FG is "0" dud Lhe riiiiL 
2 bits of thA cip.arrh ri^^ta i "00", the pointer for "00" 
corresponding to the first 2 bits of the search data is selected 
(step A4J . Since the pointer for "00" indicates the equivalent 
table N2_t, the ccarch ccction 12 inputs the information of the 
'"i 10 first entry stored m tne equivalent tahie N2 t (step A6 ) . 

O SincG thG node information flag FG o£ the first entry is 

tr - 
■J 

S| "2" (YES al iiLep A7 ) , Lhe ^search becLiou 12 deLeruiiueb- whelhei 

Q the data bit etririrj of the first entry in the equivalent table 

N2_t matches the search data with respect to the search bit 
in length thereof (step Aa). . 

In thic example, the data bit ctring of the first entry 
is "00000000" and the search hit length thereof is "2". Since 
the first 2 bits of the search data "00011111" matches that of 
the data bit string "00000000" (YES at step AS), the search 
20 sftotlon 17 smrfts thft rftlatftd informatinn "A" .qf.orftd in rhp.fi r.qt 
entry and a length of matching bits "2" into the memory (not 
shown) (step Ay). Thereafter, the second entry or tne 
equivalent table N2_t is selected (step All) and then the steps 
AS-AIO are repeatedly performed while sequentially selectlnq 
25 the entries of the equivalent table N2_t until the entry in 
question reaches the tail entry. Here, in the cases of the third 



2001 01/25 THU 13:55 03 3288 3222 Ktsuragi Patent OSTROLEM 



FQ5.512 



and fourth entries, the step A8 determines that the first 2 bits 
of the search data "00011111" matches that of the data bit string . 
Therefore, a pair ot the related information "C" stored 1n the 
third entry and a. length of matching bits "4" and a pair of the 
5 reiatea information "D" stored in the fourth entry and a length 
of matching bits "5" are stored in the memory (not shown) (step 



When the entry in question reaches the tail entry (YES 
at step AlO), the search section 12 selects the related 

10 information "D" having the longest matching length and outputs 
it aa a search result to the output device 31 (step A12). 

in this embodiment, a pair of the related information and 
the search bit length is stored in the memory (not shown) at 
Lhti s Lepb A3 and A9 . 

15 Alternatively, the equivalent table N2_t may be previously 

sorted in ascending order of search bit length » In this case, 
only the related information is stored in the memory (not shown) 
at the steps A3 and A9 and the related information stored latest 
is output as a search result to the output device 31. 

21J The major portion of search time is occupied by memory 

access time. Accordingly, it can be assumed that the time 
rp.qnireci to acce?J? onp. p.ntry of thp. pquiVrilAnt table N?._t i 5=: 
substantially equal to the time T required to follow one node 
or tne tree, in tne case where a search result is obtainea from 

25 the equivalent table N2_t, the maximum cearch time ic 5T that 
ib uULdiiied by dddlny Lhe Lime T required to follow thti nudtt 



A9) . 
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NT and thp. timp. dT rp.qnirp.d to sA^irnh all AntriAf^ of i-hp. 
equivalent table N2_t . In the case where the equivalent table 
N2 t is not accessed but one of the nodes Nil-Ni4, the maximum 
search time is 4T as described before. Therefore, the maximum 
5 sedrch time of the embudimeiit is 5T. 

As described above, the equivalent table stores a 
plurality of entries without the need of pointers to child nodes 
as in the case of a tree structure, resulting in. reduced amount 
of memory to store the data structure. For example, the size 

iO of "Che data sirucrure of the Dreseni embodiment as shown in fig, 
6 ic cmaller than that of the conventional data structure as 
shown in Fiq. 4, Each node in the data structure accordlug to 
the embodiment needs a total of 34 bits, which is greater than 
that oJl the conventional data structure as shown in Fig. 4 by 

15 1 hit ber.ansp. Fi node, information flag FG for each node in tne 
embodiment is 2 bita while that of the conventional data 
Structure is 1 bit. in the case where each entry in the 
equivalent table is composed of 2-bit node information flag FG, 
1-blL tall entry flag, a-bit data blL string (L=8), 4-bit search 

20 data length, and 8-bit related information, a total of hits for 
e&ch entry in the equivalent table is 23. Accordingly, when 
45 noaes ana 4 entries are storea as snown m Kig. 6. a total 

of bite in the data structure are 365 ( =• 34 x S + 23 x 4). 

in contrast, since each of 14 nodes needs 33 bits in the 
25 conventional data structure as shown in Fig. 4. a total of bits 
is 462 (= 33 X 14). Therelore, compared with the conventional 
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dat.fl strnntnrA , t.hft prp.<=;ftnt p.mhoriimp.nt. ;^11nws thA 5=:i 7.ft of data 
structure to reduce by about 22%. 

ir'uriher, tne present; emboaiment can reduce tne number of 
addreceee to £5 tore the data ctructure, Accuming that the number 
5 of bits for each address is 32, the conventional data structure 
as shovn in Fig. 4 needs 28 addresses becauso two addresses 
are needed to store data for each of 14 nodes. 

Contrarily, arrnrdi ng to thft prftsftnt embrxi'i ment , 
the number of nodes is 8 , the number of entries in the equivalent 
^ ■'■ 10 table IS 4. tne necessary numcer of aaaresses to store aata tor 
^ , one nod© ic 2, and the ncooccary number of addrccccc to ctore 

SJ data for one entry is 1- Accordlnqly, the necessary number of 

Q'- addresses to store the data structure as shown in Fig. 6 is 20 

(=2x8 + 1x4). Therefore, compared with the conventional data 
\fi struntnrft, thft pre.sftnt embodiment all ows the number of addresses 
in the memory to reduce by about 20%, 

DATA STRUCTURE CONSTRUCTION 
R&f erring to Fig. S, a seareh system is composed of a 
coiupuLer 41, tin inpuL device 51 t>uch as a keyboard and a puinLiiiy 
70 device, an output device 61 such as a monitor, and a memory 71. 
which arc connected to the computer 41. The computer 41 
implements a data construction section 42 and a search section 
43 by running corrc3ponding programs stored in the memory 71 
sucn as a nonvolatile semiconauctor memory , a aisK, or tne liXe. 
25 A memory 44 is a main memory for a processor to store data to 
be searched and corresponding related information, which are 
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used to construct a aata structure according to tne present 
invention . 

The search section 43 has the same functions as the search 
section 12 as shown in Fig. 5. The data construction section 
42 includes a controller 421, a tree lormation section 422 
inrJudIng a roiito. poi nr.o.r rAgistP.r ^2.1, a rp.pl anAd nndA .qp.l Antnr 
424, and an equivalent table formation section 425, The 
controller 4-^1 controls tne operation ot the components m the 
data construction cection 42. 

The tree formation section 422 has a function of forminq 
a tree structure (here, quad tree) used for radix search on 
the meiuory 44 based on the data Lo be searched and Lhe . 
corresponding related information, which are stored in 
the memory 4 4 . 

The replaced node selector ^Z^ nas a tunction ot 
calculating the maximum search time in the tree structure formed 
by Lhe Lree formaLloii sec Lion 422 and further a fuiicLion of 
selecting a node to be replaced with an equivalent table on 
condition that , if the node is replaced with the equivalent table , 
men the maximum search time for a new data structure including 
the equivalent tabic ic not longer than the calculated ma:^imum 
search time ana the necessary amount of memory is reduced. 

Ths equivalent table formation section 425 has a function 
of foriiiiay dii equlvdleiiL Lable from the tree s true Lure uf Lhe 
.qp.lp.r.ted node or lower so that the equivalent table stores all 
items o£ data to be searched included in a set of data indicated 
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by tne selected noae to oe replaced with the equivalent table 
and a function of replacing the tree ctructurc of the 
selected node or lower with the equivalent table. ^ 
Hereafter, the data structure construction operation will ; 
5 be described by referring to Figs. 9-12. 

-Rp^fp.rring to Pig. 10, whAn thp. nnntrnllpr 4.21 inf^trnrt*^ 
the tree formation section 422 to form a tree data structure 
D for radix search, the tree tormation section ^22 forms a tree 

%j data structure ae chown in Fig. 11 on the memory 44 based on 
SI 

]0 the data to be searched and corresponding related information 

!^ as shown in Fig. 9 (step Bl). 

""-I 111 Fly. 11, the lufuriiiciLluii s lurtid In each uf Lhe nudetsi 

O N1-N13 1 s simi 1 to that, in each of the nodes N1-N14 c;hnwn 

Ms 

f|l in Fig, 4. 

LS Thereafter, rne tree rormation section ^22 stores a 

pointer to the root node Nl in the root pointer register 423 
(step B2). When the tree formation operation of the Lree 
formation section ^22 has been completed, the controller 421 
starts the replaced node selector 424. 

2iJ The replaced node selector 424 calculates a maximum search 

time Tma:: for a tree structure of the root node Nl indicated 
by the pointer stored in the root pointer register 4'^3 (step 
B3) , The maximum search time Tmax is calculated by Tn x NLmax, 
whtire Tn is the time required to follow one nude of Lhe tree 

25 and NLmay is a level of a lowermost node in the tree. 

Further, the replaced node selector 424 sequentially 
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selects the nodes of the tree st.riictnrft as shown in Fig. 11 
according to a predetermined rule. For example, the nodes of 
Fiq. 11 are sequentially selected in ascending order of level 
and, wh©n two or mors nodac are found at the same level, 
3 Lhesie uudea are stiqutJiiLially ^selec Led lii branch urder presented : 
"00" . "01" , -10" . and "11" . 

The replaced node selector 424 determines whether a 
O sequentially selected node satisfies the following conditions: 

a) when the sequentially selected node is replaced with an 

'"--J " '■ 

~4 iO equivalent table to form a new data structure, the maximum 

search time Tmax t calculated form the new data structure 

\- lb nut lunger Lhcia Lhe maxiinum sttarch tiiiie Tiiicla calculcitba 

IB. 

O at the step B3; and 

!1 

fij b) when the sequentially selected node is replaced with an 

"01 

^ 1.^ equivalent table to form a new data structure, the necessary 

^ amount of memory for the new data structure is smaller than 

^ 

before the replacemeni . 

A decision on whether a sequentially selected node 
satislios the condition (a) is made depending on, for example, 
9.(1 rhp. foJ. lowing equation: 

^ Nr. X K, when K = Te/Tu, 

Where No is tne numoer ot items ot aara to oe searcnea (wnicn 
is the same ac that of entries in the equivalent table) included 
in a set of data designated by the seQuentially selected node, 
25 is the number of levels of the sequentially selected node 

or lower in the tree structure, Tn is the search time per node. 
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and Te is the search time per entry in the equivalent table. 
If Nq ^ Nl ^ K for Q selected node, then it is determined that 
the selected node satisfies the condition (a) . 

i 

When having found a noda satisfying both tho conditions 
5 (a) and (b), Lhe replactjd iiodti iJeleuLyi 424 uulpulti a pulaLer 
r indicating the found node to the equivalent table formation 
section 425 (step D4). 
Q When the replaced node selection operation ot the 

\i replaced node selector 424 has been completed ac described above, 

.i' It) the controller 421 starts the equivalent tahle forTnation section 

Q 425. 

h • ■ 

Si Upon recfelpt of Lha pointer r indicatiny the found iiudb 

O from the replaced node selector 424, the equivalent table 

frj formation section 425 releases a tree structure of thft found node 

Or? 

1.5 or lower- Thereafter, the equivalent table formation section 



425 cearohes the memory 44 for a oolleotion of data to be searched 
and correspondinq related information, which are included in 
a set of data designated by the found node, and forms an 
equivalent table from the found collection of data and 

20 corrAspontil ng rftlatftd Information (fitftp RS). Than, the 

equivalent table formation section 425 stores the equivalent 
rable formed at the step into the location indicated by the 
pointer r on the memory 44 (ctcp B6). 

The controller 421 may repeatedly start the replaced nude 

25 selector 424 and the equivalent table formation section 425 
arbitrary times. In the case where the replaced node selector 
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424 is Started by the controller 471 two or morft timp.?;, a 
previously selected node is canceled before performing the step 

. Referring to Fig. 12, the tro© etructura of the node N6 

3 ox luwtJJ-' lij replaced wlLh dii equlvuieiiL table N6_L. Thia daLa 
structure is constructed from the tree structure formed by the 
tree formation section 422 by operating the replaced node 
Q : . selector 424 and the equivalent table formation section 42 5 
once. 

SI 

^ > 10 An example of the aata structure construction operation 

P will be described, taking the case of Tn - Te - that is, K 

Si = Te/Tn = 1 as an bxaiuplb , where Tn Is the secirch Lime per Jiode , 

Q ' and Te is the search time per entry in the equivalent table. 

The tree formation section 422 forms a tree data structure 
l^ as shown in Fig. 11 on the memory 44 and stores a pointer to 
the root node Nl in the root pointer regicter 423 (ctepc Bl and 
B2). Thereafter, the replaced node selector 424 calculates 
a maximum search time Tmax for the tree structure as shown in 
Fig. 11 (step B3). Since the lowermost level of the tree 
2t] strnctnrft shown 1n Fig. 11 1fi "4", r.hA mriximnm .cp.rirch timp. 
TmoK for the tree structure is 4T. 

Subsequently, the replaced node selector 424 sequentially 
eelectc the nodec starting with the root node Nl according to 
breadth-first search, if the node Nl is replaced with an 
25 equivalent table, then the number of entries in the equivalent 
table is 8. Therefore, the maximum search time Tmax_t required 



m 
m 
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to searcn a new data structure including the equivalent table 
±z 8T. S:incc Tmax_t = 8T cxoccdc Tmax - 4T, thic node Nl does not 

satisfy tne condition (a), this causes tne replaced node 
selector 424 to select a subsequent nod© N2 . 
J ir Llie node Nl is replaced wlLli au equlvdleaL Ldbltj, Lheii 

r.hp. nnmhAr of Ant.riA<=: in thA Aqnivalent tablA i 4. ThA.rAfnrp., 
the maivimum search time Tma:v_t required to search a new data 

Structure including tne equivalent taoie is wnicn is 



'"4 obtained by adding the search time T for the node Nl to the search 

D 10 rime 4T for ine 4 -entry equivalent tat)le. since Tmax t = 5T 

GxCQGds Tmax = 4T, this noda N2 does not satisfy th© condition 

a 

(d). Thib caui?eb Lhe replaced uude belecLur 424 Lu i>t;leuL 
'f^ a next node N6 because the breadth-first search schAme is 

employed. 

i:i It the node No is replaced witn an equivalent taDie. tnen 

the number of entries in the equivalent table ie 2. Therefore, 
the maximum search time Tmax_t required to search a new data 
structure including the equivalent table is 3T, which is 
obtained by adding the search time T for the node Nl to the search 
20 time 7T for the 7 -entry equivalent table. Since Tmax_t = 3T is 
smaller than Tmax 4T, this node N6 satisfies the condition 
(a) . 

Furthermore, in the case where each node needs 34 bits 
ciiid ecich entry of an equivalent table needs 23 biLi>, the release 
25 of the nodes N6 and its child nodes N7-N9 reduces the number 
of bits by 13C (- 34 x 4) and the number of bits required for 
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tne equivalent table N6.t is 46 ( = 23 x 2J . Tnerefore, formation 
of the equivalent table N6_t achicvca a reduction of 00 bits 
(= 136 - 46), Which means that the noae N6 also satisfies the 
condition (b). The replaced node selector 424 selects the nod© j 
3 N6 aiid ouLpuLs a pointer to Lh© node N6 Lo Lhe equivdleaL Lablt; j 
formation .ciAnri nn 

Upon receipt of the pointer to the node N6 from the 
replaced node selector 4'^i4. the equivalent taDie tormation { 
cection 425 releasee a tree ctructurc of the node N6 and its 

U) Child nodes N7-N9 . as shown in Fiq. 11 , a set of data desiqnated • i 



^ by the node N6 consists of all the bit strings starting with 

"10". Tht; eViuivaleiiL Lablts furiuaLluii 5>ecLiuii 425 iifcjarclitiii 

■a the memory 44 for a collection of data to be searched and 

Q 

1^ corresponding related information, which are included in the 

l.S set ot data designated by the node N6< 

In thic example, the memory 44 ctorec the data to be 
searched and correspondiny related information as shown in Fig. 
9. Therefore, the equivalent table formation section 4.25 finds 
two pairs of data, (1000, E) and (10000, F) , as a collection 
21) of data to be searched anci cnrrftspont^l ng relateri Information, 
which arc included in the act of data designated by the node 
No. The equivalent table formation section 425 forms dara of 
entries for the equivalent table N6_t from the found pairs of 
data (1000, E) and ( 10000, r ) , respectively. Each uf the eiitriei> 
25 stored in the equivalent table N6__t has a format composed of 
a 2-bit table node information flag field FG, a tail entry flag 
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tieia. a data Dit string tieia, a searcn bit lengtn tieid, and 
3 related information ficid. Each data bit ctring ic obtained 
by adding "o' to a corresponding item of data so that the toit 
iength thereof is equal to the maximum bit length of search data. 
The equivalent table N6_t formed as described above ±5 stored 
at f.hft 1 onaM on 1 nrilcar.ftd by the pointer to the. node Nfi on 
the memory 44 . 

AS described above, formation oi an eauivalenr table 
according to the present embodiment can achieves a reduction 
10 of necessary amount of memory without increasing in maximum 
search time. In addition, the present embodiment can reduce 
the uuiiiber of addresses to store the data structure. Assuming 
that the number of hits for each address is 3^., thA r.onvp.nt i nnal 
data structure as shown in Fig. 4 needs 28 addresses because 
Ifi i^ two addresses are needed to store data tor each or 14 nodes. 

2 Contrarily, according to the present embodiment, as shorn 

ill Fig. 12, the number of nodes is 9, the number of eiiLrlea iu 
the equivalent table N6_t is 2, the necessary number of addresses 
to store data for one node is 2, and the necessary number of 
20 addresses to store data for one entry is 1. Accordingly. 

the necessary number of addrc33C3 to store the data structure 
as shown in Fig. 12 is 20 (= 2 x 9 + 1 x 2) . Therefore, compared 
with the conventional data structure, the present embodiment 
tiliowa Lhe number of addreasey in Lhe uiemury Lo reduce by <abuul: 
75 '?9^ . 



